Picture for Ivaxi Sheth

Ivaxi Sheth

PersistBench: When Should Long-Term Memories Be Forgotten by LLMs?

Add code
Feb 01, 2026
Viaarxiv icon

Funny or Persuasive, but Not Both: Evaluating Fine-Grained Multi-Concept Control in LLMs

Add code
Jan 26, 2026
Viaarxiv icon

ProtocolLLM: RTL Benchmark for SystemVerilog Generation of Communication Protocols

Add code
Jun 09, 2025
Viaarxiv icon

Causality Is Key to Understand and Balance Multiple Goals in Trustworthy ML and Foundation Models

Add code
Feb 28, 2025
Viaarxiv icon

Safety is Essential for Responsible Open-Ended Systems

Add code
Feb 06, 2025
Viaarxiv icon

MedG-KRP: Medical Graph Knowledge Representation Probing

Add code
Dec 17, 2024
Viaarxiv icon

CausalGraph2LLM: Evaluating LLMs for Causal Queries

Add code
Oct 21, 2024
Figure 1 for CausalGraph2LLM: Evaluating LLMs for Causal Queries
Figure 2 for CausalGraph2LLM: Evaluating LLMs for Causal Queries
Figure 3 for CausalGraph2LLM: Evaluating LLMs for Causal Queries
Figure 4 for CausalGraph2LLM: Evaluating LLMs for Causal Queries
Viaarxiv icon

LLM4GRN: Discovering Causal Gene Regulatory Networks with LLMs -- Evaluation through Synthetic Data Generation

Add code
Oct 21, 2024
Figure 1 for LLM4GRN: Discovering Causal Gene Regulatory Networks with LLMs -- Evaluation through Synthetic Data Generation
Figure 2 for LLM4GRN: Discovering Causal Gene Regulatory Networks with LLMs -- Evaluation through Synthetic Data Generation
Figure 3 for LLM4GRN: Discovering Causal Gene Regulatory Networks with LLMs -- Evaluation through Synthetic Data Generation
Figure 4 for LLM4GRN: Discovering Causal Gene Regulatory Networks with LLMs -- Evaluation through Synthetic Data Generation
Viaarxiv icon

Hypothesizing Missing Causal Variables with LLMs

Add code
Sep 04, 2024
Viaarxiv icon

LLM Task Interference: An Initial Study on the Impact of Task-Switch in Conversational History

Add code
Feb 28, 2024
Figure 1 for LLM Task Interference: An Initial Study on the Impact of Task-Switch in Conversational History
Figure 2 for LLM Task Interference: An Initial Study on the Impact of Task-Switch in Conversational History
Figure 3 for LLM Task Interference: An Initial Study on the Impact of Task-Switch in Conversational History
Figure 4 for LLM Task Interference: An Initial Study on the Impact of Task-Switch in Conversational History
Viaarxiv icon